WaterBench-Iowa: a large-scale benchmark dataset for data-driven streamflow forecasting
نویسندگان
چکیده
Abstract. This study proposes a comprehensive benchmark dataset for streamflow forecasting, WaterBench-Iowa, that follows FAIR (findability, accessibility, interoperability, and reuse) data principles is prepared with focus on convenience utilizing in data-driven machine learning studies, provides performance state of art deep architectures the comparative analysis. By aggregating datasets streamflow, precipitation, watershed area, slope, soil types, evapotranspiration from federal agencies organizations (i.e., NASA, NOAA, USGS, Iowa Flood Center), we provided WaterBench-Iowa hourly forecast studies. has high temporal spatial resolution rich metadata relational information, which can be used variety research. We defined sample task predicting next 5 d future results this linear regression models, including long short-term memory (LSTM), gated recurrent units (GRU), sequence-to-sequence (S2S). Our model show median Nash-Sutcliffe efficiency (NSE) 0.74 Kling-Gupta (KGE) 0.79 among 125 watersheds 120 h ahead prediction task. makes up lack unified benchmarks earth science research accessed at Zenodo https://doi.org/10.5281/zenodo.7087806 (Demir et al., 2022a).
منابع مشابه
A Large-scale Dataset and Benchmark for Similar Trademark Retrieval
Trademark retrieval (TR) has become an important yet challenging problem due to an ever increasing trend in trademark applications and infringement incidents. There have been many promising attempts for the TR problem, which, however, fell impracticable since they were evaluated with limited and mostly trivial datasets. In this paper, we provide a large-scale dataset with benchmark queries with...
متن کاملBenchmark Forecasting in Data Envelopment Analysis for Decision Making Units
Although DEA is a powerful method in evaluating DMUs, it does have some limitations. One of the limitations of this method is the result of the evaluation is based on previously data and the results are not proper for forecasting the future changes. So For this purpose, we design feedback loops for forecasting inputs and outputs through system dynamics and simulation. Then we use DEA model to f...
متن کاملTrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild
Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset ...
متن کاملWHOI-Plankton- A Large Scale Fine Grained Visual Recognition Benchmark Dataset for Plankton Classification
Planktonic organisms are of fundamental importance to marine ecosystems: they form the basis of the food web, provide the link between the atmosphere and the deep ocean, and influence global-scale biogeochemical cycles. Scientists are increasingly using imaging-based technologies to study these creatures in their natural habit. Images from such systems provide an unique opportunity to model and...
متن کاملMS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Earth System Science Data
سال: 2022
ISSN: ['1866-3516', '1866-3508']
DOI: https://doi.org/10.5194/essd-14-5605-2022